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Abstract. 

The KATRIN (KArlsruhe TRItium Neutrino) experiment will be analyzing the 
tritium beta-spectrum to determine the mass of the neutrino with a sensitivity of 
0.2 eV (90% C.L.). This approach to a measurement of the absolute value of the 
neutrino mass relies only on the principle of energy conservation and can in some 
sense be called model-independent as compared to cosmology and neutrino-less 
double beta decay. 

However by model independent we only mean in case of the minimal extension 
of the standard model. One should therefore also analyse the data for non- 
standard couplings to e.g. righthanded or sterile neutrinos. As an alternative to the 
frequentist minimization methods used in the analysis of the earlier experiments in 
Mainz and Troitsk we have been investigating Markov Chain Monte Carlo (MCMC) 
methods which are very well suited for probing multi-parameter spaces. We found 
that implementing the KATRIN x^- function in the COSMOMC package - an 
MCMC code using Bayesian parameter inference - solved the task at hand very 
nicely. 



Analysis of KATRIN data using Bayesian inference 



2 



1. Introduction 

The Karlsruhe Tritium Neutrino Experiment KATRIN [H [2] will be the first beta 
decay experiment attempting to measure the electron neutrino mass with sub-eV 
precision. Presently the experiment is commissioned to start data-taking in 2013/14 
and has a projected sensitivity of 0.2 eV (90% C.L.) to the neutrino mass. 

KATRIN is the successor of the experiments in Mainz |i3j and Troitsk J4j and will 
be using some of the same techniques as those. For the technical details of KATRIN 
see e.g. j5]. 

Strictly speaking when measuring the 'electron' neutrino mass with /3-decay 
spectra, what we get is the socalled kinematic neutrino mass. That is, the incoherent 
sum of neutrino mass eigenvalues weighted by the appropriate entries in the lepton 
mixing matrix: 

n 
1=0 

However, because the mass differences between the active neutrino mass states 
are known to be smaller than KATRINs sensitivity the experiment can effectively 
only see one mass state (the mass squared differences are Am^2 = 8 x 10~^ eV^ and 
Am23 = |2.6 X 10~^| eV^, respectively [6j). This mass state is sometimes called the 
'electron' neutrino mass, but in principle the tritium beta-spectrum could contain the 
signatures of more than one mass state or of couplings to other particles entirely. In 
order to be called truly model independent KATRIN's final data should be analyzed 
also for alternative scenarios - beyond the minimal extension of the standard model. 

Performing an analysis for non-standard couplings to the electron neutrino adds 
more parameter space to the x^-function of the experiment. One should therefore 
consider how an extended analysis should be performed on the KATRIN output in 
order to get reliable results. 

We present here one approach which seems to give several advantages over the 
standard frequentist analysis. In section 2 we describe our analysis methods before 
presenting results for a number of cases in section 3. Finally we give some concluding 
remarks in the last section. 
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2. Methods: PVequentist and Bayesian analysis tools 

2.1. The principles of Mainz and KATRIN data-analysis 

Let us begin by summarising tlie procedures for production and analysis of KATRIN 
spectra as performed by a toy model Monte Carlo and analysis code for KATRIN- 
like experiments [2]. This code has previously been used to forecast the experiment's 
sensitivity to the neutrino mass [7j. 

Because KATRIN has an integrating spectrometer (a consequence of the MAC- 
E (Magnetic Adiabatic Collimation with Electrostatic) filter technique) the beta- 
spectrum must be written as an integral over the electron energy: 

^{E,,mlj-frUEe,qU)dE, (1) 

Here U is the retarding potential of the spectrometer, Ntot is the total number 
of tritium nuclei in the source, tu is the measurement time allotted for a given value 
of the retarding potential and fres is the experimental response function (which in 
turn is a combination of the electron energy loss function of the tritium source and 
the transmission function of the spectrometer). is the theoretical beta spectrum 
rate folded with the electronic final state distribution of molecular tritium. 

A retarding voltage-independent background rate of Bq is now added to Eq. (fl]): 

Nb = Bo-tu . (2) 

This gives us the following theoretical expression for a KATRlN-like spectrum: 

NthiqU, Eo, mlj = N,{qU, E,, mlj + N, . (3) 

Individual spectra (to resemble the real measurements) are built using 
initial parameters m^^ q and i?o,o for the neutrino mass squared endpoint energy, 
respectively. To the theoretical expression is then added a random component from 
a gaussian distribution with a{qUi) = A/iVTTiVb and fi{qUi) = Ns + Nf,: 

Ne^piqU) = NsiqU, Eo,o, J + A^^ + Rnd {Gauss{a, /i)) . (4) 

When we want to fit our randomized beta-spectra we have to account for 
statistical fluctuations by allowing the overall amplitude A of the signal as well as 
the background rate B to vary against the theoretical amplitude Aq and background 
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energy Eq to deviate from the initial parameters of the simulation m^^ q and -Eo,o- 



9 ^ NJqU,Eo,ml ) Nh 

NfuiqU, A, B, Eo, mlj = A ■ ' + B ■ ■ (5) 

Combining Eq.'s (g]) and ([5]) we finally get KATRIN's x^-function [2]: 

X (A, 5, Eo, - ^ -^-^^^^ J . (6) 

The analysis of the simulated data can be performed with Minuit2 which is 
imbedded in the ROOT-package. This procedure performs a minimization of the 
X^-function using CombinedMinimizer. CombinedMinimizer in turn uses either an 
evaluation of the covariance matrix or a simplex method to find the best minimum 
of the x^-function in the parameter space j8]. One can now do a standard frequentist 
analysis to find the statistical uncertainty on e.g. the neutrino mass by producing 
a suitable amount of Monte Carlo spectra, performing the minimization for each of 
them and finally inspecting the resulting histograms. An example is shown in Figure 
[T]for 12860 spectra produced with m^j = 0.0 eV. 

However as previously indicated the minimization approach has a number of 
drawbacks. For one thing it does not give any information on multiple minima, 
and it is not well suited for finding shallow minima. Furthermore extracting detailed 
information on correlated parameters is pretty laborious. Still the method works just 
fine for the four well-known free parameters used in a standard KATRIN analysis - 
see Table [H But as one adds more parameters the minimization procedure often 
becomes problematic and rather slow. 



2.2. Bayesian parameter inference with COSMOMC 

As an alternative approach we have considered Markov Chain Monte Carlo and 
bayesian inference techniques as in the publicly available COSMOMC analysis 
package for cosmology. Typical cosmological models contain ~ 8-12 parameters 
and COSMOMC is well suited for relatively fast analysis of such multiparameter 
spaces [9j. The programme is built for analysis of large cosmological datasets such as 
CMB data from WMAP and supernova surveys but can in principle analyse whatever 
dataset the user provides - the cosmology can be 'turned off' if it is irrelevant. 
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Figure 1: An example histogram depicting the neutrino mass-squared values from 
minimizations of 12860 heta-spectra for a KATRIN-like parameter set and an assumed 
value for the neutrino mass of m^^ = measured with an optimized time distribution over 
the last 25 eV of the beta-spectrum, e.g. compare to J^. 



COSMOMC uses bayesian statistics for the analysis. When doing socalled 
bayesian parameter inference one is interested in knowing the posterior probabihty, 
P{6\D, M) - the probabihty of the parameters, 6, given the data D and the model 
M. The 'inverse' question is for the probability of the data, D, given the parameters 
and the model, P{D\6, M) - this is simply the likelihood function. With these two 
probabilities and the well-known Bayes theorem. 



P{A AB)= P{A) ■ P{B\A) = P{B) ■ P{A\B), 



(7) 
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one can write an expression for the posterior probability: 

Here L is the hkehhood - which can be easily derived from the x^-functioiJi|. 
The posterior probability is thus proportional to the likelihood. 

Meanwhile 'k{9\M) is the socalled prior probability sometimes referred to as the 
subjective input - it is what we believe we know from theory before even taking the 
data into account. Correspondingly this probability has no dependence on the data. 
Note that we have been using flat priors on all input parameters in this paper. 

Finally e{D\M)^ the evidence, is in effect only a parameter-independent 
normalization constant [§|[10j. 

When we want to know best-fit values and confidence levels of specific 
parameters we can then simply integrate over all the remaining (nuisance) 
parameters. This is called marginalization and the output is called the marginalized 
probability for the parameter of interest. 

In addition to this rather convenient production of parameter probability 
distributions, from the bayesian inference approach, COSMOMC gives us another 
great advantage by using a Markov Chain Monte Carlo (MCMC) to probe the 
parameter space. This will provide a very thorough and easy to inspect mapping of 
the parameter space of interest. 

The purpose of the MCMC is to probe the whole parameter space in a 
randomized manner. To achieve this one implements the Metropolis Hastings 
algorithm [TT] consisting of three main steps: 

• Firstly an initial point, 6q is chosen. 

• Secondly a step is proposed in some random direction, after which the new point 
is evaluated: P{Oi + 6p). Here 6i means the iterative point, and 6p is the proposed 
addition taken from some proposal density. 

• Finally the procedure decides whether or not to take the step. The point 9i + Op 

X The likelihood function L is connected to the x^-function as in the following way: L — 
exp(-xV2)- 

§ However, it can be important in some contexts, e.g. in comparing two qualitatively different 
models 
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is accepted if the posterior probability is improved. That is if 

^Wj^' > 1. (9) 

If the expression above is < 1 the step is accepted with some probabihty r (rejected 
with probabihty 1 — r). In this manner we generate a set of points {Oi}, also called 
a Markov Chain. For the number of points, A^, going to infinity we thus have a 
representation of the posterior probability. 

The decision procedure of the Metropolis Hastings algorithm allows the chain 
to wander away from any local minima and thus potentially discover other minima 
(to a degree determined by the value of i|Jj]. On the other hand it also guarantees 
that the parameter space near the minima is very well probed. Furthermore one can 
perform the analysis on a combination of multiple chains - all started at random 
positions - and get an even better picture of the behavior of the different parameters 
in the allowed intervals. To get rid of un-physical effects from the random starting 
points one normally allows for a burn-in - i.e. the first part of the Markov Chain is 
removed. In our case the burn-in is 50% of the sample size. 

Before running the programme one must carefully choose stepsizes and 
parameter ranges. Several settings in both the COSMOMC programme as such 
and in the parameter files can be tweaked to fit ones purpose. 

Unfortunately it is in principle not possible to determine in any absolute 
terms whether or not a specific chain has converged [H], but various convergence 
diagnostics have been developed. For instance when analyzing multiple chains a 
convergence parameter, R, defined as the variance of the chain means divided by the 
mean of the chain variances, can be evaluated. If 1 — i? is less than some chosen 
small number (in our case 0.03) this information is interpreted as good convergence. 

When COSMOMC has generated the chains we need, the data analysis is 
performed giving us best-fit values and standard deviations for all the parameters. 

Additionally COSMOMC produces a number of useful Matlab-files which can 
be used to produce ID and 2D plots of the marginalized distributions. Inspecting 
this graphical output allows us to determine if the chains have really converged, 
whether there are multiple minima and perhaps most importantly it shows parameter 
correlations right awajull. 

II In our case r is defined as e'^-^ ^^"^ , with temperature, T = 1. 

% In fact COSMOMC - or rather GetDist - produces a multitude of diagnostics files in addition to 



Analysis of KATRIN data using Bayesian inference 



8 



If we go through all of this for say a single Monte Carlo generated beta spectrum 
we get all the nice advantages mentioned above. But the analysis of that one 
spectrum would take many hours as compared to minutes or seconds with the Minuit2 
procedure and we would mostly just have achieved a much slower evaluation of the 
best fit values for that particular spectrum. However if we in stead use the theoretical 
beta-spectrum (which should represent the average of infinitely many measurements 
or Monte Carlo realizations) as our input data - but with Monte Carlo generated 
errorbars - our best fit values and standard deviations from COSMOMC should 
correspond to the results of the frequentist approach of building histograms for a 
very large (going to infinity) number of measurements. 

To recap we implemented our x^-function for KATRIN-like experiments in COS- 
MOMC and simply turned off cosmology. As data-set we have used the theoretical 
spectra - for any given model - with the Monte Carlo generated errorbars of the 
original code. The results will be discussed in the following section. 

3. Results 

3.1. The minimal model 

As a first test of our methods we have attempted to reproduce the KATRIN 
sensitivity. We thus generated a tritium beta-spectrum using as input so far only 
four parameters: The electron neutrino mass squared m^^ q, the endpoint of the beta- 
spectrum -Eo,c0, the background count rate Bq and the signal count rate near the 
beta-spectrum endpoint, Aq - see eq. ([3]). 

The input signal count rate can be calculated as a combination of the column 
density of the source and the magnetic fields and cross sections of the spectrometer 
and source. In our case the count rate near the endpoint Eq is included in the 
analysis code via an amplitude factor Aq as in eq. ([5]). The exact definition and 
full calculation of this factor Aq is included in Appendix A of reference [12j. Given 
KATRIN's experimental settings the amplitude has the value A^ = 477.5 Hz. 

the graphical output. More information on these can be found on the COSMOMC homepage [9]. 
In addition, it is fairly easy to edit the Getdist programme to produce output files that fits ones 
purpose 

+ To suppress the number of digits, we rather plot and write its deviation from 18575 eV - i.e. AEq 



Analysis of KATRIN data using Bayesian inference 



9 



Parameter 


unit 


typical input value 




eV2 


0.0 - 1.0 




eV 


0.0 


5o 


Hz 


0.01 


Ao 


Hz 


477.5 



Table 1: KATRIN standard analysis parameters. Please note, that we show and plot the 
deviation of the value of the endpoint of the beta-spectrum from the theoretically expected 
value: Eq = 18575 eV. 



We would like to remark, that the value of the endpoint energy Eq needs to 
be treated as a free parameter to produce realistic fits with respect to fitting of 
m^^. Up to now the ^He - ^H mass difference is known from precision Penning trap 
experiments with 1.2 eV precision |13| . but already the fits of the experiments at 
Mainz and Troitsk |3] would have needed a much more precise input value to 
justify keeping Eq fixed in the fit. 

The values of our free parameters and the are listed in Table [1] 
The COSMOMC results for a theoretical spectrum with m^^^Q = 0.0 eV are 
presented in Figure [2l Clearly the chains have converged nicely in this case and the 
parameters seem well-constrained. The output values of our analysis are 



ml^ = (-0.41 ■ 10"^ ± 0.013) eV^ 
AEo = (0.87 ■ 10"^ ± 0.22 ■ 10"^) eV 
B = (1.00- 10^2 ±0.15- 10"^) Hz 
A = (477.0 ±0.16) Hz, 



We note that our analysis gives a statistical error on the neutrino mass squared 
of 0.013 eV^, but the statistical uncertainty from the frequentist analysis shown in 
Figure [His 0.016 eV^ and thus ^ 23% larger than our bayesian result Q 

* One can calculate KATRINs sensitivity to the electron neutrino mass at 90% C.L. using the 
following equation (and assuming gausssianity): 
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Figure 2: A one-neutrino analysis witli input mass niy^fi = 0.0 eV. In the COSMOMC 
output the contours (dotted lines) mark the hlielihood function in the 2D (ID) - 
distributions. Color (full lines) mark the data-point distributions. The results have 
converged nicely and gives = —0.41 • 10~^ ± 0.013 eV. 



As a test we therefore calculated the statistical uncertainty for a 11 different 
values of the neutrino mass in the frequentist approach. A comparison with bayesian 
results are presented in Table [2J We see that we get systematically higher values of 

Here cr(Q((m^) = a^^^^. + a^y^, and KATRIN's systematic error on the neutrino mass squared is the 
0.017 eV^ quoted in [2]. The bayesian analysis gives a^^^f = 0.013 eV^ and a sensitivity of 0.19 eV 
(90% C.L.) on the electron neutrino mass. 
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the statistical uncertainty when using frequentist methods for the calculation. 

If we try instead to compare the chi-squared values of the two methods the 
results are more encouraging. We performed the analysis on 10 different Monte Carlo 
generated spectra keeping the mass square fixed at eV^ (the true input value) and 
0.04 eV^ (the sensitivity squared) respectively. This gave identical chi-squared values 
as shown in Figure El The relative deviations between the methods were no larger 
than ~ 2 ■ 10"^ for ml fixed at eV^ and ~ 0.5 • 10"^ for the ml fixed at 0.04 eV^ 

60 I 1 1 1 1 1 1 1 1 1 




123456789 10 

# Monte Carlo realisation 

Figure 3: The value as calculated in COSMOMC (using with flat priors for all input 
parameters) and ROOT for ten different Monte Carlo realizations of KATRIN-like spectra 
(36 d.o.f). The results coincide both when the analysis is performed with the mass square 
fixed at the true input value (ml^ = eV^ ) and when the mass square is fixed at KATRINs 
sensitivity squared (ml^ = 0.04 eV^). 

We are therefore led to believe that the difference in (Jstatitril) is simply down 
to the use of bayesian instead of frequentist statistics. The procedure with which 
we find the standard deviations of the parameters (through a marginalization over 
nuisance parameters) is very different from a frequentist approach. Perhaps it is not 
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m^, [eV] 


0.0 


0.1 


0.2 


0.3 


0.4 


0.5 


0.6 


0.7 


0.8 


0.9 


1.0 


(TstatMy [10"^ eV^] 


1.31 


1.34 


1.38 


1.39 


1.45 


1.51 


1.53 


1.59 


1.61 


1.66 


1.69 


(Tstatjre [l^'^ eV^] 


1.64 


1.70 


1.79 


1.95 


1.88 


1.91 


1.93 


2.04 


2.02 


2.09 


2.21 



Table 2: The statistical uncertainty on the neutrino mass squared for 11 different input 
values. The second row has been calculated using the bayesian approach in the COSMOMC 
analysis, while the third row has been calculated in the usual frequentist approach assuming 
a gaussian distribution function. 



too surprising that we do not produce a curve with the exact same specifications as 
the histogram of Figured] We do however get the same x^-value as well as the correct 
outputs and certainly the right trend in the behavior of cf statif^t) ■ Furthermore we 
note that our results make no use of any gaussianity assumptions. In conclusion this 
bayesian approach seems to produce robust results. 

3.2. Sensitivity plot 

As a first application of our bayesian analysis formalism we have built a sensitivity 
plot for KATRIN-like experiments. Or rather an illustration of the behavior of the 
statistical uncertainty on the neutrino mass squared as a function of key experimental 
settings. 

The sensitivity to the electron neutrino mass of a KATRIN-like experiment 
depends on the signal strength, the background count rate and the energy resolution 
of the experiment. Given the fairly well understood effect of the background on the 
sensitivity we investigate only the effect of the the signal strength and the energy 
resolution - the specific values used are listed in Tables [3] and |H 

Additionally we include an optimization of the measurement time distribution 
for each of our KATRIN-like experiments (as specified by their amplitude and energy 
resolution). This optimization has in fact a great deal of infiuence on the reachable 
sensitivity of such an experiment. Currently KATRIN is projected to have a runtime 
of three years but because of experimental stability issues the measurements are 
performed as a relatively fast scan over the electron energies of interest (or rather 
retarding voltages) of total duration 966 s. The measurement time allotted to each 
data point, the tu in eq. [T], has been carefully optimized for KATRIN's experimental 
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Amplitude [Hz] 


2.0 


4.0 


8.0 


16.0 


32.0 


64.0 


125.0 


250.0 


500.0 


1000.0 



Table 3: Amplitudes investigated in the sensitivity plot presented in Figurel^ 



Energy resolution [eV] 


0.5 


1.0 


2.0 


4.0 



Table 4: Energy resolutions investigated in the sensitivity plot presented in Figure\^ 

settings as described in [2]. 

The basic structure of the measurement time distribution can be represented as 
three segments around the region of the beta spectrum endpoint. 

• Firstly one needs measurements up to about 10 eV above the endpoint of the 
beta-spectrum Eq to determine the correct background. 

• Secondly the 'bulk' region of interest below the endpoint of the beta-spectrum Eq 
must be treated carefully. Effectively this is the section we optimize. 

• Thirdly, previous investigations by the KATRIN collaboration have pointed out 
that there exists a region of maximal sensitivity to the neutrino mass. This region 
is centered around the electron energy Ee for which the signal counts equals 2 
times the background count rates: A^^ = '^Nf, [H]. Above this narrow region 
the total count rate is dominated by background noise. Below it the sensitivity 
to the neutrino mass drops as one goes away from the endpoint. Therefore 
extra measurement time is devoted to measurements in this interval. Given the 
projected KATRIN background count rate of 0.01 Hz the total count rate of the 
critical point must be 0.03 Hz. 

As a rule we construct the background-block of our distribution as 10 points 
separated by 1 eV each having tu = 60 s. The main block contains a total of 
30 points with varied spacing (to be optimized) and tu = 40 s. Finally 60 s are 
added to tu for 0.02 Hz < Nth{U) < 0.04 Hz. We find that the 40-100-60 block- 
structure does a good job of simulating both the structure and total duration of the 
measurement time distribution while still being simple enough to manipulate. For 
the sake of comparison we show the fully optimized KATRIN measurement time 
distribution and one of our simplified distributions in Figure HI 

To build the sensitivity plot we let a script evaluate CTstati^te) ^'^^ 
combination of all energy resolutions given a specific amplitude. The evaluation 
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Data point intervals [eV] 


0.1 


0.2 


0.4 


0.8 


1.0 


1.2 


1.4 


1.6 


1.8 


2.0 



Table 5: The allowed data point intervals used in the main block of our measurement time 
distributions. 



Measuring time distribution 









0.2 

0.18 

0.16 

0.14 

So.12 
0} 

.i 0.1 
1- 

0.08 
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Figure 4: The black squares represent the standard KATRIN measurement time 
distribution and the red triangles show an example of a simplihed time distribution with 
measurement points separated by 1 eV in the main block. This particular time distribution 
contains only one point in the range: 0.02 Hz < NthiqU, Eq, m'^J < 0.04 Hz. 



include an optimization of the measurement time distribution: For the first energy 
resolution we find the best statistical uncertainty using all the available distributions 
(as specified by their data point intervals in the main block - see Tabled]). We then 
impose a time saving condition stating that the next energy resolution is allowed 
only to use the previous best distribution and its closest neighbors. And so on for 
the remaining energy resolutions. The reason for doing this 'spline' between energy 
resolutions and not between amplitudes is the expectation that the largest sensitivity 
fluctuations will take place in the amplitude direction. 

We present the average of 10 sensitivity plots as our result in Figure [5]- including 
points corresponding to the statistical uncertainty of the Mainz |3] and Troitsk |3] 
experiments. The Figure shows a clear log- log dependence of CTstati^lJ on the 
amplitude. The dependence on the energy resolution on the other hand is very 
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weak. Making a fit to the plane of tlie sensitivity plot gives us: 

logio(a,tat(m2j) = 0.058 ■ \og^^{5E) - 0.70 ■ logio(Amp) + 0.0038. 

That is roughly a factor 12 stronger dependency on the amplitude than on 
the energy resolution in this fit (with i?j^j=0.9989). We should note, that these 
simulations are done without considering the corresponding systematics, which would 
most likely give a stronger dependence on the energy resolution AE. 

Further it can be noted that the statistical uncertainties from the Troitsk and 
Mainz experiments (respectively 2.5 and 2.2 eV^ - taken from their final results) are 
somewhat above the plane. This is as could be expected given the fact that the plot 
was built specifically from a KATRIN toy model. And as mentioned above we get 
systematically lower statistical uncertainties using our bayesian analysis algorithm 
than from corresponding frequentist methods. 

Finally we noted a tendency in our procedure to choose large data point 
separations for the 'optimal' measurement time distribution. This effect - a lower 
statistical uncertainty caused by analyzing a larger energy interval below the Q- value 
- was already demonstrated in Figure 19 of |T]. For our purpose we have kept the 
systematic uncertainty on the 0.017 eV^ as projected for the KATRIN experiment 
[2j and our sensitivity so to speak depends only on the statistical uncertainty. 

3.3. Sterile neutrinos 

After these initial tests of our COSMOMC extension we have tried adding further 
parameters in our routine. 

As previously mentioned it is obvious that KATRIN can not resolve the mass 
squared differences between the known active states - of Am^2 = 8 x 10^^ eV^ and 
Am23 = |2.6 X 10~^| eV^, respectively. However sterile neutrinos with mass states in 
the eV range could in principle mix with Vf.. Such neutrinos would provide a much 
better target for direct detection in beta decay experiments than the active neutrinos 
which are expected to have sub-eV masses. Their relatively high mass would allow for 
an easy separation from the primary decay signal in experiments such as KATRIN. 

Recently the MiniBooNE collaboration confirmed their previous findings and 
more indications of a fourth mass state can be found in the so-called reactor anomaly 
[261 |27] . Even cosmology suggests an effective number of neutrino species slightly 
larger than three |T9| [20] |2T| 122] . We have therefore performed a more thorough 
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Figure 5: Our final sensitivity plot result showing the statistical uncertainty on the 
neutrino mass squared as a function of amplitude and energy resolution for a KATRIN-like 
experiment. It is very clear from the figure that (J statiT^t^) is strongly dependent on the 
signal count rate (as could be expected) and to a much lesser degree on the energy resolution, 
5E. For comparison the figure also show the corresponding statistical uncertainties of the 
Troitsk, Mainz and KATRIN experiments (The KATRIN-point was calculated with the 
COSMOMC approach presented here). 

investigation of the possible detection potential for sterile neutrinos by KATRIN-like 
experiments. For the total results see [12j and references therein. 

Here we will present only our main results for effectively a 1+1 (active + sterile) 
neutrino scenario. We can justify having only one active neutrino by the known mass 
squared differences. We kept the active neutrino massless and performed the analysis 
for a broad range of sterile neutrino masses. For KATRINs standard settings (see 
Table [T]) we get the result presented in Figure O 
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Figure 6: The upper Egure shows the sigma detection potential of the massive sterile 
neutrino for the standard KATRIN-like settings, while the lower shows the corresponding 
statistical deviation (in eV^) on the massless neutrino. The x- and y-axis depicts the 
logarithm of the sterile mass squared and the mixing weight. The red mesh illustrates 
the 3a level and the standard one-neutrino statistical uncertainty of around 0.013 eV^ 
respectively (for this analysis method). As one would expect the mass and the mixing 
weight must he rather high in order to get a good detection of the sterile component. 

We found that KATRIN should be able to perform a 3a detection of any of the 
heavy mass states we used as long as |t/esp ^ 0.055. Likewise, Correspondingly a la 
detection is achievable for |[/esP ^ 0.018. 
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5.^. Right handed currents 

As a final application we take a look at righthanded currents. We use the notation 
of Stephenson et al in |25] and let h parametrise the strength of the right-handed 
interaction. We define h = prcos6r/ cos 6, where cos^ is the mixing angle from the 
mass eigenstate to the left-handed current weak eigenstate and cosO^ is the mixing 
angle from the mass eigenstate to the corresponding right-handed current weak 
eigenstate. Again we assume for simplicity that the mass of the electron neutrino can 
be described by one effective mass eigenstate. pn is the ratio of the effective strength 
of interactions mediated by righthanded currents to the strength of interactions 
mediated by the well-known lefthanded weak current. Then the differential beta 
spectrum is modified in the following way in the presence of righthanded currents 



From many data of weak precision experiments reviewed in reference [16J Bonn 
et a/ derive an upper limit on \ ^^ \ of 0.31 (99.7% C.L.) [15] translating to \b\ < 0.16 
in our parameterization 0. It is clear from Eq. [TT] that the mass and the coupling 
parameter, b, are strongly correlated. Combined with the well known correlation 
between the neutrino mass squared m^^ and the endpoint energy AEq [5j this will 
propagate to an additional b - AEq correlation. Figure [7] shows a COSMOMC output 
for all the parameters in a model with input values bo = —0.13 and rriu^^Q = 0.4. The 
input parameter ranges were ml^ o^^-5 eV^ and —2.5 < bo < 2.5, respectively. We see 
that even within these rather large conservative intervals, which were investigated, 
neither m,y^ or b can be determined well. 

As another example Figure [8] shows the behavior of the m^y^ - b correlation for 
a range of masses and bo = ±0.13. Funnily enough for small input masses this 
allows for a better (J statiiT^t^) than in the purely lefthanded case, if the right-handed 
coupling constant b would be known. We believe this is caused by the large available 

tJ Note that the last term is equivalent to 2 times the similar expression in |15| in the limit of small 
b. We wish however to use Eq. [TT] retaining the physical meaning of b, rather than treating the 
coupling as an effective parameter 

ff However it is clear that if | y^^tI < 0.31 can also give the limit |6| > 6.3 and obviously the effect 
of 6 ^ and |6| 3> 1 would give very similar spectra using Eq. [TlJ However b > 1 has no physical 
meaning cf. our definition 



125J3: 




(11) 
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Figure 7: A typical COSMOMC output for a model with bo = -0.13 and m^^fi = 0.4 
eV. One clearly sees the correlations between m^^, b and AEq. Despite quite large input 
parameter ranges for and b neither is well constrained. 



phase-space in the 6-direction. This means the hkehhood can be made very narrow 
around the input-value, while still providing many valid solutions. However we also 
notice that when KATRIN's sensitivity is reached the uncertainty on the mass will 
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be determined again by the experimental limitations and not by numerical solutions. 
Therefore the correlation between and h reappears. 





-1 X Coupling strength, b Coupling strength, b 



Figure 8: The left figure sliows marginalized COSMOMC 2D likelihood contours and 
MCMC data for models with bo = —0.13 and m^^^ ranging from 0.1 eV to 0.9 eV (going 
from the upper left corner to the lower right corner). The right figure has bo = 0.13 and the 
same range of input masses. The correlation between and b only establishes itself when 
the neutrino mass is larger than KATRINs sensitivity of 0.2 eV. This allows for smaller 
(T stati'm'^^) than in the case of bo = because the MCMC-routine lets the uncertainty on 
the mass fill the extra parameter dimension (b). 



Given these disconcerting initial results we investigated how large an influence 
the presence of righthanded currents might have on the output neutrino mass. We 
have performed the full COSMOMC analysis for the parameters given in Table [6l 
For 6 = the analysis was the standard analysis, i. e. without a 6-dimension. Our 
main results are presented as a relative bias compared to the 6 = -case (with the 
exclusion of m^^ = in the mass bias, to avoid inflnities): 

= 100 ■ (12) 

The results are shown in Figures [9] and [TOl Note that we included 6=0.19 to 
create a better overview of the behavior of mf, . 

The results shows us flrstly that the output mass values fluctuate rather wildly - 
and in some cases deviate my as much as ^ 80 % from the input values as shown in the 
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Table 6: Input neutrino masses and righthanded coupling strengths used to produce 
Figures 



Relative bias on m [%] Relative bias on o (rr? ) [%] 



E 

^ 0.5 




-0.15 -0.1 -0.05 0.05 0.1 0.15 -0.15 -0.1 -0.05 0.05 0.1 0.15 



input coupiing strength, b input coupiing strength, b 

Figure 9: The left figure shows the bias on rriy^ as compared to the case without 
righthanded couplings and the right figure shows the bias on a statifnt^) ■ This analysis 
includes the righthanded coupling strength as a free parameter. The bias on the mass is as 
large as 80% while the values of agtatin^uj -'^ times as large as for the standard 

case (barring the parameter range below KATRINs sensitivity where the uncertainty on 
the mass parameter migrates into the b-dimension in the MCMC). 



left panel of Figure |9l And secondly the statistical uncertainty is up to 5 times larger 
than in the standard case except in regions where rriu^ < 0.2 eV as expected from 
the discussion above. Turning to at the output values of the righthanded coupling 
strength in Figure [TO] we get appallingly bad results especially in the m^^^ = 0.2 eV - 
region. From the left hand picture of Figure [TO] one might get the impression that the 
output value of b is returned rather nicely for the larger masses. However as shown 
in the right panel of Figure [10] the relative error is still up to ^ 60% in some regions. 
In conclusion we see from these numerical artifacts that it is extremely difficult to 
get a good determination of both the mass and the coupling strength. At least when 
using fairly large parameter intervals. Given stronger limits the situation would no 
doubt change. But judging from our COSMOMC contours the values in some cases 
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Relative bias on b [%] Reiative bias on b [%] 




-0.15 -0.1 -0.05 0.05 0.1 0.15 ' -0.15 -0.1 -0.05 0.05 0.1 0.15 



Input coupling strengtti, b Input coupling strengtti. b 

Figure 10: The left figure shows the hias on h for the full parameter range of Tat>le\^and 
the right Egure is an enlarged version of this plot for nxy^ > 0.5 eV. In the left figure we 
see that we get the output values wrong by more than a factor 10! This is exacerbated 
at mass values just above the KATRINs sensitivity once again demonstrating how the 
uncertainty on these two ill-determined parameters is redistributed in the parameter space 
of the Markov Chain. The right Egure shows us that when we look beyond the much larger 
errorbars around m^^ = 0.2 eV the output values still fluctuate with errors of order w 60%. 



will be pressed to the largest allowed parameter values even when the intervals are as 
broad as here. In other words - tighter parameter values in this case merely amounts 
to a manual setting of the allowed size of the statistical uncertainties. 

Next we perform the analysis on the same spectra without including the 
righthanded coupling strength to get an idea of the bias imposed on the neutrino 
mass in the presence of unaccounted-for righthanded currents. We present our results 
in Figure [TT] 

As it turns out we get much better results when we remove the 6-dimension from 
our COSMOMC setup - this time the bias on the mass is no larger than around 12 
%. We notice however that the statistical error drops steeply for high masses and 
coupling strengths. Inspecting the original COSMOMC likelihood contours we see 
that this is because AEq has been pushed to the edge of the input interval as shown in 
Figure [121 This also explains why the bias flips in the same parameter-range instead 
of becoming monotonically larger for maximal coupling strengths. The propagation 
of the uncertainty on b into the Ai^Q-dimension is straightforward from the already 
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Relative bias on m^^ [%] 





-0.05 0,05 

input coupiing strengtti, b 



-0.05 0.05 

input coupiing strengtti, b 




Figure 11: This Egure sliows tlie same biases as Figure [9| hut liere tlie analysis lias 
been performed (on the same spectra) without the inclusion of the righthanded coupling 
strength. Clearly the errors on m^^ are much better and for realistic b-values certainly 
within acceptable zizW%-ranges. However we note that agj-„i ™2 is ~ 60% better in the 
high-b, high-m corners of the right-side plot. This coincides with a turnover of the bias on 
the mass in the left-side plot. Figure [T2| shows that this behavior takes place because the 
AEq parameter is being pushed to the maximally allowed values, which should be avoided. 
That is the uncertainty on the mass due to the presence of b is migrating into the third 
correlated parameter - the endpoint of the tritium beta spectrum. 



discussed correlations between the b, m^^ and AEq - parameters. Hopefully the 
upcoming much more precise ^H-^He mass measurements [Ul will be helpful in 
resolving this issue for the KATRIN experiment. 

In conclusion the bias induced on the neutrino mass is now within acceptable 
bounds and agree well with the results found by Bonn et al jl5]. Finally it should be 
noted that an experiment such as KATRIN can clearly not be used to put bounds 
on the size of the righthanded coupling strength at this point. A precise knowledge 
of the neutrino mass and the tritium beta-spectrum endpoint Eq would be have to 
be presupposed before measurements of the tritium beta spectrum could be used to 
determine b. 
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Figure 12: The figures shows the 2D lilielihood contours of AEq vs. m^^ for the mass 
range 0.1 eV to 0.9 eV (again going from the upper left corner to the lower right corner) 
when the analysis is performed without the inclusion of b. The figure on the left used 
spectra that was produced with b = —0.19 while the hgure on the right is for b = 0.19. 
The expected output for AEq is zero, but it is clear to see that in this case the b, ni^^, AEq 
-correlation pushes the uncertainty induced in the mass parameter by the physical presence 
of b into the AEq -parameter instead. 

4. Conclusions 

Our attempt at an analysis of simulated KATRIN data with various additional 
parameters has shown the following: For the standard case of analysis with regard 
to one neutrino mass, the MCMC approach is certainly well suited and gives robust 
results. 

The method is very practical when performing analysis for non-standard cases 
because the COSMOMC output lets us inspect the behavior of the parameters and 
their relation to one another in a straightforward manner. We have used the method 
to build a sensitivty-plot for a KATRlN-like experiment, clearly demonstrating the 
dominating dependence of the sensitivity on the signal countrate. 

Further we have learned that for a suitable mass-squared difference an 
experiment such as KATRIN should be able to detect the existence of other 
neutrino mass states. And finally we have re-evaluated the influence of couplings to 
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righthanded currents in the tritium beta decay and found that ignoring this would 
maximally induce an error on the neutrino mass of order 10%. 

In conclusion we find that our bayesian approach to the analysis of the KATRIN 
experiment is certainly competitive to a frequentist approach and that it has 
several advantages over it when using an already well-developed framework such 
as COSMOMC. 
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